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Abstract 

An extensive literature in economics and social science addresses contests, in which players 
compete to outperform each other on some measurable criterion, often referred to as a player's 



^ | score, or output. Players incur costs that are an increasing function of score, but receive prizes 

for obtaining higher score than their competitors. In this paper we study finite games that are 
discrctized contests, and the problems of computing exact and approximate Nash equilibria. 
Our motivation is the worst-case hardness of Nash equilibrium computation, and the resulting 
interest in important classes of games that admit polynomial-time algorithms. For games that 

■ have a tie-breaking rule for players' scores, we present a polynomial-time algorithm for com- 
00 ' puting an exact equilibrium in the 2-player case, and for multiple players, a characterization of 

■ Nash equilibria that shows an interesting parallel between these games and unrestricted 2-player 
games in normal form. When ties are allowed, via a reduction from these games to a subclass 
of anonymous games, we give approximation schemes for two special cases: constant-sized set 

' of strategies, and constant number of players. 

1 Introduction 

C$ , We consider a class of finite games, and the problem of computing their exact and approximate 

Nash equilibria. In these games, each pure strategy of a player corresponds to a score, i.e., a level 
of attainment (on some measurable criterion) . Each score also has an associate cost of effort (which 
we assume here is player-specific), where a higher scores requires higher effort. If all players select 
strategies, then their payoffs are obtained as follows. The players are ranked according to the 
scoreqj they selected, and prizes are awarded to players according to their position in the ranking. 
The overall payoff to a player is the value of the prize he wins, minus the cost of effort for the 
strategy he chose. Consequently the players face a trade-off between the cost of a strategy, and its 
effectiveness at winning prizes. We call these games competitiveness-based ranking games. 

By way of illustration, consider a set of athletes who are training for a race. Each athlete 
spends time and effort in training, and has an increasing function that maps this upfront cost to 



"This work was supported by EPSRC grant EP/G069239/1 "Efficient Decentralised Approaches in Algorithmic 
Game Theory." A preliminary version of this paper appeared as |19j . 

*We are following Siegel [33] in using "score" to refer to this quantity; it is also called "output" in labor-market 
contests ([215] and subsequent papers), or the "bid" [27] . 
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performance (speed in the race). His total utility is the value of the prize that he wins, minus the 
cost of making this initial effort. Note that the prize is awarded based on his speed relative to other 
competitors, with no consideration given to his speed taken in isolation. 

1.1 Motivation 

For unrestricted normal-form games, the PPAD-completeness results of |13[ 15] suggest that a Nash 
equilibrium is hard to compute in the worst case. Faced with a worst-case hardness result, there are 
two general routes to computationally positive results. We may move to the problem of computing 
a weaker solution concept, such as approximate Nash equilibrium (defined in detail in Section \2.2\i 
where for some e > a player's incentive to change strategy is at most e. For this direction, 
progress has been rather limitedJl The main open problem of this line research is to determine 
the existence of a polynomial-time approximation scheme (PTAS); it is known from [9] that a fully 
polynomial-time approximation scheme (FPTAS) is as hard as computing an equilibrium exactly. 
The alternative route to positive results is to note that the PPAD-hardness of computing a Nash 
equilibrium relies on a class of highly- artificial games, suggesting that we instead focus on special 
cases that represent important and well-motivated games. When we find that (exact or approxi- 
mate) Nash equilibria can indeed be efficiently computed for such a class of games, this overcomes 
the complexity-theoretic objection to using Nash equilibrium as a solution concept. Various classes 
of potential games, games on congestion networks, and games on graphs (representing social net- 
works, or networks with local interactivity) do indeed have efficiently computable Nash equilibria, 
see |31| for an overview. The contests for prizes that we study here have a rich economics literature, 
but we believe that this is the first paper to analyze the problem of computing their Nash equi- 
libria. While we find that certain algorithms for anonymous and polymatrix games are applicable 
to special cases, we also exhibit novel polynomial-time algorithms and approximation schemes for 
games in this class. 

The kind of games that we study here are often constructed by a competition organizer (for 
example, sporting contests, or reward structures in organizations). Hence there is an associated 
mechanism design problem of structuring the contest in such a way as to elicit competitive behavior 
from the participants. Efficient computation of Nash equilibria for these games should help with 
the problem of determining, for example, a good allocation of prizes in such a contest. We noted 
that the strategies available to players have associated levels of effort and score, so in a given Nash 
equilibrium it is straightforward to compute, for example, the total effort or score of the players. 
In this way we have a well-defined measure of goodness of a Nash equilibrium. 

Notice that for the players, it is socially optimal to use strategies having minimal effort, since all 
prizes must be allocated, irrespective of what the players have achieved (it is relative, not absolute, 
achievement that gets rewarded). However, in a Nash equilibrium, players will typically be more 
than minimally competitive. Competitiveness amongst the players results in a positive externality 
— in the context of spectator sports, the spectators prefer to see a well-run race, or in the context 
of research contests [8] (such as the DARPA Grand Challenge), competitiveness leads to research 
progress. We continue by reviewing the background literature in more detail. 

2 For polynomial-time algorithms the lowest e that has been achieved for bimatrix games is just over | [35], and 
for well-supported approximate equilibria (for which there is a further constraint that no positive probability may 
be allocated to any pure strategy that is worse than the best response by more than e) it is just below | [TS]. For 
normal-form games with more than 2 players, known results are even weaker [5] [2]] ■ These e- values assume all payoffs 
lie in the range [0, 1], so that achieving e = 1 is trivial, while e = corresponds to an exact equilibrium. 
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1.2 Related work 



As we noted, there is an extensive literature on contests where players exert effort with the aim of 
outperforming their rivals. The new aspect of this work is our focus on the design of polynomial- 
time algorithms for computing outcomes (equilibria) of these games. Most previous work considers 
continuous games, where players have a continuum of actions to choose from, with real-valued 
functions from effort to score. Here we study a discretized version so that we have finite games. 
These finite games can still serve as approximations to the continuous ones, and the resulting 
discretized functions benefit from a natural representation (as a list of pairs of values associating 
effort cost with score, for each player) so that we have a clear notion of "input size" of a problem 
instance, as needed in the context of polynomial-time algorithms. 

The most closely-related class of finite games in the literature, appears to be the ranking games 
of Brandt et al. [3]. In such a game, the outcome of any pure-strategy profile is a rank-ordering 
of the players, and a player's utility is a decreasing function of his position in the rank-ordering. 
However, the games studied in [3] allow an arbitrary dependence of rankings on the pure-strategy 
profiles that may cause them. There is no requirement that certain pure strategies are more likely 
than others to raise a player's ranking. This leads to computational hardness results, notably that 
unrestricted ranking games are hard, even for just 3 players [3]. 

Anonymous games represent a class of finite games that relate to the discretized contests con- 
sidered here. Anonymous games are games where a player's payoff depends on his own action 
and on the distribution of actions taken by the other players, but not on the identities of the 
players who chose each action. Anonymous games admit polynomial-time approximation schemes 
(PTAS's) \12\ HH] but may be PPAD-complete to solve exactly. The algorithms for anonymous 
games can be applied to an interesting subclass of the discretized contests that we study here. In 
particular they apply to a special case in which all players have the same (finite) set of score levels 
available to them, with prizes being shared in the event of ties (which is a standard assumption in 
much of the literature). 

We next mention some of the more classical literature on continuous contests. An influential 
line of work is the literature on rent-seeking problems, initiated by Tullock [36J. These are problems 
in which players compete to receive favorable treatment from a regulator. Another large body of 
literature initiated by Lazear and Rosen [26] has focused on contests in labor markets. Lazear and 
Rosen study the merits of rank-based prizes (as an alternative to paying a piece rate) as a means of 
incentivizing effort by workers in organizations. These models incorporate a random noise process 
that affects the selection of the winner. In a Tullock contest, the probability of winning is the 
amount of effort exerted by a player, divided by the total effort. Besides being a model of artificial 
competition, games of this kind are a model for competition for status within society [23| . 

Closer to the setting of this paper, are contests where the outcome is a deterministic function 
of effort spent by the players. Siegel |33j studies properties of the Nash equilibria of (continuous) 
contests, in a setting where all prizes have the same value, so that a contestant either wins or 
loses. He analyses expected payoffs to players, and also participation (which refers to the decision 
by a player to make more than a minimal expected effort). An important special case is first-price 
all-pay auctions, where an item (the prize) is sold to the bidder who makes the highest offer, but 
all bidders must pay, even if they lose. In an all-pay auction, there is a sense in which (cost of) 
effort is the same thing as score. Where players have different valuations for the prize, Baye et 
al. [1] show that this can be interpreted as score being some linear function of effort, with scaling 
factor proportional to the valuation. For the case of two players, Hillman and Riley [22] study the 
cumulative probability distributions of effort choices in the unique Nash equilibrium of the game; 
the uniqueness of the equilibrium is proved in [T] . The latter paper also extends the analysis to the 
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case of first-price all-pay auctions with many players. A survey of results and models considered in 
contests (without noise) is contained in [25] . In general, our games can be viewed as a discretized 
version of contests without noise and could be used as an approximation to these games, provided 
the discretization is fine enough. 

As we noted, this model has a corresponding mechanism design problem of allocating values to 
a set of prizes in a contest (so as to maximize total effort or score by the players). A well-known 
paper of Moldovanu and Sela [27] analyses this question in a setting where the players' functions 
(from effort to score) differ from each other in being linearly scaled by each player's ability. They 
obtain results for allocating value to prizes, given prior distributions over the players' abilities. ( [27] 
also begins with an informative and readable discussion and motivation for the study of contests.) 
Moldovanu et al. [29] study how to elicit maximal effort via the selection of rank-based status 
classes, where a player gains utility from having others assigned to lower classes, and disutility 
from others in higher classes. (In [29] each player has a privately- known ability, relating score to 
cost of effort, which has been generated by a probability distribution. The solution concept is a 
shared function mapping ability to effort.) Their model relates to our result for linear-prize games 
(Section \3.3\i . in which the value of taking k-th. place in the ranking is linear (decreasing) as a 
function of k; that essentially corresponds to d status classes each of size 1, d denoting the number 
of players. Note that the setting we study here is different, in that players have arbitrary discretized 
functions from effort to score, and the functions are all commonly known. An alternative approach 
to maximizing effort [28] considers how to divide the competitors into sub-contests whose winners 
then compete in a final round. 

Szymanski [M] considers the application of the theory of contests to the design of sporting 
competitions. The allocation of prizes in dynamic sport contests in which players determine their 
efforts at different stages of the game (e.g., at the beginning of each half in a soccer game) is 
considered in [7J. There, the focus is on comparing rank-based versus score-based prizes when 
spectators care about contestants' efforts or about the "suspense" of the game (see also [27] 17]). 
Cohen et al. pT] study a version of the contest design problem where the prize fund may be chosen 
by the designer, who wants to maximize the effort elicited, with prizes representing the price paid 
for effort. A related line of research }20t [6] has addressed from a game-theoretic perspective, the 
impact of the point scoring system on offensive versus defensive play in the UK Premier League; 
our concern here is slightly different, being focused on highly competitive versus weakly competitive 
play. 

1.3 Our Contribution 

Some of our algorithms apply specifically to an interesting special case of games without ties where 
players cannot share a prize as a result of obtaining the same scorell For games without ties, 
we give an efficient algorithm for the 2-player case, and for the multi-player case we show how to 
compute the probabilities in a Nash equilibrium if the supports of the players' distributions are 
known (Theorem [2]). (This shows an interesting parallel with general 2-player bimatrix games, 
where a Nash equilibrium having known support can be efficiently computed.) When ties are 
possible — in the literature, the standard assumption is that prizes are shared — it is convenient to 
reduce these games to equivalent score- symmetric games (Definition [2|) in which all players have 
the same available set of score values, but with player-dependent costs. The reduction incurs an 
increase in the number of strategies that is proportional to the number of players. For these games, 

3 As an example of this consider the competition amongst universities for places in a ranking, or league table. In 
such a ranking, it is necessary to list the names of institutions in a strict order; ties have to be broken somehow, and 
this is often done alphabetically by name of institution. 
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Table 1: Our algorithmic contributions. 



a special case of interest arises when we assume a constant limit on the number of these strategies, 
and the results of [121 E] can be used to provide a PTAS; here we give a simpler PTAS for such 
games. Table Q] gives our algorithmic results for both classes of games, together with linear-prize 
games (Section I3.3|) . a class of ranking games in which the prize for taking position k in the ranking 
is a linear function of k. 

2 Model, notation and some illustrative examples 

We work in a classical game-theoretic setting of a finite number of players, each with a finite 
number of actions, and we consider the problem of computing Nash equilibria, and approximate 
Nash equilibria, for these games. 

In Section f2. II we specify in detail the class of games that we study, and introduce some notation 
and terminology. Section 12.21 gives the background definitions of Nash and approximate Nash 
equilibrium. Section [2.31 shows some examples to illustrate various technical issues. 

2.1 Definition and Notation 

A prize refers to the reward that a player gains from obtaining a specified position in the ranking, 
and this relates directly to the standard usage of "first prize", "second prize" etc in competitions. 
We say that an action is "stronger" or "more competitive" than another one, if its score is higher. 
Any pair of actions are comparable in this sense, whether or not they belong to the same player. 

We next formally define the class of games we introduce and study in this paper. Throughout, 
we let d denote the number of players in a game, and n the number of strategies available to each 
player. 

Definition 1 In a competitiveness-based ranking game, the j-th pure strategy of player i, denoted 
atj, has associated a cost c*- and a score s*. We assume they are indexed in increasing order of 
competitiveness so that, for all i, j, we have c*- < c* +1 and s* < s l - +l . Any pure-strategy profile 
results in a ranking of the players according to the scores. A player whose position in the ranking 
is k gets awarded the k-th prize, having value u&. Prizes are non-increasing with respect to ranks: 
Uk > Uk+i, for 1 < k < d, with the assumption that u\ > In the event of a tie (where two 
or more players obtain the same score and are ranked equal) the prizes that would result from tie- 
breaking are shared. The total payoff to a player will be the value of the prize he is awarded, minus 
the cost of the action selected by that player. 

4 This assumption simply rules out an uninteresting case in which competition is not adequately incentivised. 
Indeed, when u\ = Ud, the profile in which all players play their least competitive action is a dominant strategy 
equilibrium. 
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For the purpose of designing algorithms that search for a Nash equilibrium, we can assume 
without loss of generality that we have strict monotonicity of effort costs and scores, in the definition 
above. If two different actions (i) have the same cost and different scores then the stronger dominates 
the weaker and (ii) have the same score and different costs then the cheaper will dominate the more 
expensive. 

Next we define an interesting subclass of the games we consider. 

Definition 2 A score-symmetric game is a competitiveness-based ranking game in which all players 
have the same set of pure strategies (which we denote a±, . . . ,a n ) having the same scores (which we 
denote s\, . . . ,s n ). Costs remain player- specific, and c*- denotes the cost to player i of pure strategy 
a r 

2.2 Exact and approximate Nash equilibria 

Here we give the definitions of Nash equilibrium and approximate Nash equilibrium, also some 
further notation we use throughout. Let Si be the set of player i's pure strategies; Si = {dj}j. Let 
S = Si x . . . x Sd be the set of pure-strategy profiles, where recall d denotes the number of players. 
It is convenient to define S-i = S\ x . . . x Si-i x x . . . x 5 rf as the set of pure-strategy profiles 
of all players but i. 

A mixed strategy for player i is a distribution on Si, that is, real numbers x l j > for each 
strategy a* 6 Si such that SjeS; x ) = 1- A set of d mixed strategies (one for each player) is 
a mixed strategy profile. By u\ we denote the utility to player i in strategy profile s. A mixed 
strategy profile {x*- }j e s i5 i = 1, . . . , rf, is called a ( mixed) Nash equilibrium if, for each i, X] ses u l x s 
is maximized over all mixed strategies of i — where for a strategy profile s = (s\,..., sj) £ S, we 
denote by x s the product x\ ■ x 2 S2 ■ ■ ■ xf . (The notation x s naturally extends to strategy profiles 
s G S—i.) That is, a Nash equilibrium is a set of mixed strategies from which no player has a 
incentive to unilaterally deviate. It is well-known (see, e.g., [32]) that the following is an equivalent 
condition for a set of mixed strategies to be a Nash equilibrium: 

u) s x s > ^2 u), s x s ==> x l j, = 0. (1) 

The summation ^2 s€ g_ i Uj S x s in the above equation is the expected utility of player i if i plays pure 
strategy j and the other players use the mixed strategies {x^ }kes i , Nash's theorem [30] 

asserts that every game has a Nash equilibrium. 

We say that a set of mixed strategies x is an e- approximately well supported Nash equilibrium, 
or e-Nash equilibrium for short, if, for each i, the following holds: 

u%x s > u )'s x s + e x), = 0. (2) 
s6S_, ses_, 

Condition ([2]) relaxes that in ([1]) by allowing a strategy to have positive probability in the presence 
of another strategy whose expected payoff is better by at most e. 

2.3 Some examples 

We consider some examples that should be helpful in understanding the model and issues arising. 
Example [T] shows that the games we consider do not always have pure Nash equilibria. 
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Example 1 Consider two players; for i = 1,2 player i has two actions a\ and a\. Suppose the 
row player (player 1) is stronger than the column player in the sense that the column player only 
wins by playing a\ while the row player plays a\ — this can be achieved by setting si = 2, s\ = 3, 
s| = 4, s\ = 5. Suppose the costs are c\ = 0, c 2 = \ for both players i = 1,2, and we have a single 
prize worth 1, i.e., u\ = 1 and U2 = 0. We have payoff matrix: 





a\ 




a\ 


(1,0) 
&0) 


(o,|) 

V2' 2> 



It is easily checked that this game has no pure Nash equilibrium and that the unique equilibrium is 
the one in which both players mix uniformly. 

Example [2] is an anonymous game with binary actions (studied in [2], although [2] studies 
a continuum of players). The example shows that in this kind of game, there may be multiple 
equilibria, and the number of equilibria may be exponential in the number of players. 

Example 2 Consider a symmetric game with an even number d > 4 of players; a single prize 
worth 1 unit; each player i has two actions a\ and 02 with costs C\ = and C2 = c (we do not have 
a superscript to identify a player, since the games are symmetric). The prize will be shared between 
players who use 02, or all players if they all use a%. 

Notice first that for c £ (0, -j] , there is a pure equilibrium in which all players play 02- For 
c € Q, l) there is, by symmetry, a fully-mixed Nash equilibrium where all players play 02 with the 
same probability. This can be seen by the following argument. Suppose each player, other than the 
first, plays 02 with probability p. The first player has an incentive to play 02 which is decreasing 
in p. In particular, for p = 1 player 1 has no incentive to play 02, while for p = player 1 has an 
incentive to play 02- Then, by continuity, there exists a value of p, say p* , for which player 1 is 
indifferent between a\ and 02- In a profile in which all players play 02 with probability p* all players 
are indifferent by symmetry. 

Now put c = I — e, where e < d ^ 2( r We claim that there are also pure Nash equilibria where 
any subset of size | play pure 02 and the others play pure a\ . A player playing 02 obtains utility 
— c + § > 0/ no incentive to switch to a±. A player playing a\ obtains utility 0, and by switching to 
a2 would obtain utility — c+ 1/(^ + 1) = e — f + 3x2 < 0. Indeed there are also many mixed equilibria 
where a subset of the players play pure a\ and the other players all use the same probabilities. 

Observe that there are no Nash equilibria where players may mix with different probabilities - 
two such players would both be indifferent between a\ and 02, but their expected payoffs from playing 
ci2 would have to differ. 

The following example shows that there is no bound on the price of anarchy and on the price 
of stability in these games. 

Example 3 Consider a symmetric game with 2 players; a single prize worth 1 unit; each player i 
has two actions a\ and 02 with scores s\ < S2 and costs c\ = and C2 = 1/2 — e, for some small 
e > 0. The payoff for playing (02,02) is e for both players, and it is higher than the payoffs obtained 
by deviating from the strategy 02'- player 1 has a payoff ofO for strategy profile (a\, 02) and so does 
player 2 for strategy profile (02, a±). Thus, strategy profile (02,02) is a pure Nash equilibrium and 
its social welfare is 2e. Now, notice that 02 is actually a strictly dominant strategy for both players 
thus implying that no other action profile is a Nash equilibrium. The action profile that maximizes 
the social welfare is (a±,ai), and its social welfare is 1. Thus, the price of anarchy in this game 
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is l/(2e). Since this Nash equilibrium is unique, l/(2e) is also the price of stability of this game. 
Because e can be chosen arbitrarily small, both price of anarchy and price of stability are unbounded. 
Note that this game is essentially the Prisoner's Dilemma in which a\ is the collaborating strategy 
and a2 is the defecting one. 

3 Algorithms and proofs 

We start by noting some preprocessing steps that establish some useful assumptions that we can 
make without loss of generality. We continue in Section 13.21 by considering separately the special 
case where players cannot tie for a position in the ranking; this case would arise in competitions 
that have a tie-breaking rule, or where the score values s*- are all distinct. The reason for a focus 
on the tie-free case is that the analysis is simpler and the Nash equilibria turn out to have a special 
structure. Section [3731 applies a result of [15] for poly matrix games, to the special case where prize 
values decrease linearly as a function of rank position. 

In Section 13.41 we study the more general case where players may tie for a position in the 
ranking. We show that we can focus without loss of generality on an anonymous subclass of these 
games. Pure Nash equilibria of these games are studied in Section [3.4.21 In Section [3.4.31 we give 
a polynomial-time approximation scheme for games with a fixed number of strategies. Finally, in 
Section [3.4.41 we give a fully polynomial-time approximation scheme for the case of constantly-many 
players. 

3.1 Preprocessing 

Results about the computation of e-approximate equilibria require us to assume that all payoffs 
in a given game lie in some fixed bounded range; usually the interval [0, 1] is assumed. Games 
whose values lie outside this this range can be have their payoffs resealed into [0, 1] without af- 
fected the strategic aspects of the game. With this in mind, we resale the payoffs of an arbitrary 
competitiveness-based ranking game as follows. We may assume that the number of prizes is equal 
to d, the number of players. This is without loss of generality — if there are more prizes than play- 
ers, then only d of them can be awarded (so all but the first d of them can be discarded without 
changing the game). Also, if there are fewer prizes than players, we can just add additional prizes 
of value 0. (Thus, when we say that we have a game with only j prizes, what we really mean is 
that Uj+i = ■ ■ ■ = Ud = 0.) The first step in the preprocessing is to ensure that Ud = 0. This can be 
done, without changing the strategic aspect of the game, by subtracting Ud from all prizes. Next, 
we ensure that u\ = 1. This can be done, without changing the strategic aspect of the game, by 
dividing all prizes, and all costs, by u\. Next, for each player i, we ensure that c\ = 0. This can 
be done, without changing the strategic aspect of the game, by subtracting c\ from all of the costs 
of player i. Finally, we may assume that no player has an action with a cost greater than 1, since 
such an action would be dominated by a\. 

Note also that the numerical values of the scores s*- may be modified without affecting the 
payoffs and Nash equilibria of the game, provided only that the modification does not affect which 
are greater than which (in which case the ranking of the players is preserved). However, it is usually 
convenient to specify numerical Sj values (rather than, more abstractly, their ordinal relationships) 
when describing a game. 

Finally, we establish a useful fact that will be used to obtain polynomial-time algorithms that 
return approximate Nash equilibria. 
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Observation 1 For any e > inverse of an integer, given a probability vector x = (xi, . . . ,x n ), it 
is possible to define a probability vector x = (x±, . . . ,x n ), called an "e-rounding" of x, in which 



1. each entry is equal to a non-negative integer multiple of e, and 



2. For every j £ {1, . . . , n}, the rounding error YjL=i( x k ~ x k) 



is less than e. 



Proof. We consider the values x\,...,x n in order. When we consider Xj, we round it to define Xj. 
If Xj is an integer multiple of e, then xj = Xj. Otherwise, we set the value of Xj by rounding Xj - 
we round up to the nearest integer multiple of e if Ylk<i Xk — ^2k<j x k an d we round down to the 



nearest integer multiple of e otherwise. This ensures that Yjk=i( x k ~~ x k) 
We now show that x is a probability vector, that is, that X^fc=i x k 
probability vector, 



< e. 

= 1. Now since x is a 




1 



Xk, 



^2( x k 

k=l 

and we already know that the latter is less than e. Since the x^s are non- negative integer multiples 
of e then J2 k Xk = ae, for some integer a. The above inequality then yields — l<a — - <1. Since 
- and a are integer numbers, we can only satisfy the previous inequality by having ^ fc Xk = 1- D 



3.2 Games without ties with a single prize 

We begin by observing that (subject to the above preprocessing) we may restrict our attention to 
actions' costs that are strictly less than 1. 

Observation 2 Assume player i has an action ah such that Cj = 1. Then a l j is weakly dominated 
by a\. Therefore, we can eliminate a l j from the game at the price of eliminating some potential 
Nash equilibria. 

Assuming that costs are strictly less than 1 , we show that Nash equilibria of games without ties 
have a nice structure when there is a single prize. Siegel [33] has shown a more general version of 
the following, in the context of continuous games. We include a proof here, since it is simpler in 
the discrete case. 



Theorem 1 Suppose there is a single prize of value 1 and actions' costs are less than 1. If no two 
actions have the same strength (thus ties are impossible) then in any Nash equilibrium 

1. There is just one player with positive expected payoff; all others have expected payoff zero. 

2. The player with positive expected payoff is the one with the strongest action with a cost of less 
than 1. 

Proof. Given the preprocessing steps noted above and the assumption that the single prize has 
value 1, the costs of all actions lie in the range [0, 1), and each player has an action with cost 0. 
Let N be a Nash equilibrium. For each player i let Wi be the weakest action of i that lies in the 
support of M; thus, i has positive probability of using Wi, and all other actions that i uses with 
positive probability are stronger than tUj. 

For any action a, let s(a) denote the score of that action. Let p be the player whose weakest 
action in the support of M is stronger than all other players' weakest actions in the support of AT, 
thus s(w p ) > s(w p ') for all p' ^ p. 
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Note that for any player p' ^ p, the expected payoff to p' from using action uy is non-positive: 
Wpi cannot win since p is certain to play a stronger action. But, p' gives positive probability to w p >, 
so no other action available to p' can have higher expected payoff, p' has non-positive expected 
payoff, and under the assumption (that we may adopt from preprocessing) that players all have a 
0-cost action, p h s expected payoff must in fact be zero. 

For the second part of the theorem, note that we have seen that all but one player must have 
expected payoff 0, in Nash equilibrium J\f. Let p" be the player with the strongest action having a 
cost of less than 1. Then p" can always guarantee a positive expected payoff by using that action, 
so in any Nash equilibrium, must receive positive expected payoff. Hence p" is the only player with 
non-zero expected payoff in M (and p" is in fact the same as the above player p). □ 

The following theorem also shows how a Nash equilibrium may be efficiently computed for tie- 
free games, provided that we know the support of the Nash equilibrium. This shows an interesting 
parallel between these games, and general 2-player normal form games, especially in conjunction 
with the subsequent observation that the solution is a rational number. 

Theorem 2 For games with any number of players, pure-strategy costs less than 1 and a single 
prize where ties are impossible, a Nash equilibrium can be computed in polynomial time if we are 
given the support of a solution. 

Proof. Given a game Q, suppose that we remove the pure strategies that are not in the support 
of some (unknown) Nash equilibrium. The resulting game Q' has a fully-mixed equilibrium Af, 
thus any two strategies that belong to a player have the same expected payoff in TV. Our general 
approach is to compute the probabilities x*- in descending order of strength of the associated actions 
a). 

Let a* be the strongest action in Q' (i.e. having the highest score). Player z's expected payoff 
is 1 — dj and by Theorem Q] all other players have expected payoff 0. 

Let a?, be the second-strongest action in Q'; we may assume i' ^ i since if i' = i then a 1 - would 
be strictly dominated by a 1 -,. Its expected payoff to i! is — c*', + (1 — x*-), which by Theorem[T]is 0, 
so we have an expression for x*-. Consider the third-strongest action a*'», whose payoff is given by 
—Cj'i + (1 — ~~ x j') (assuming i" ^ i) which gives us an expression for x 1 -,. 

Generally, the r-th strongest action has expected payoff — + n&:^«(l ~ a k) where aj~ is the 
sum of probabilities of player fc's actions that are stronger than a^. 

The probabilities for each player's weakest actions will be obtained from the equations that 
ensure that for every player i, the values x*- sum to 1 (are a probability distribution). □ 

Observation 3 For games where ties are impossible, if all action costs are rational numbers 
smaller than 1 then the solution is also a rational number. 

This is immediate from the expressions in the above proof that give the values x*- . 
3.2.1 Solving 2-player games exactly 

Ranking games (as in [3]) with actions that do not have the upfront costs A we consider here, are 
constant-sum, so they can be solved efficiently in the 2-player case. Our games are not constant- 
sum, but we do have an alternative polynomial-time algorithm to solve them in the 2-player case. 

Theorem 3 2-player ranking games that have competitiveness-based strategies and are without ties 
can be solved exactly in polynomial time. 
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Proof. As before, assume a single prize of 1 unit and action costs in [0, 1], which may be assumed 
by the preprocessing noted earlier. 

We can (in polynomial time) compute exact solutions of 2-player games of this type as follows. 
We start by eliminating certain dominated strategies. Specifically, suppose that for strategies a 1 - 
and the set of opponent's strategies that they win against, is the same. Then can be 

eliminated. Rename the strategies of this game [a\, . . . a n ) for the row player and (a[, . . . a' n ,) for 
the column player. Assume without loss of generality that it is the row player who has the weakest 
strategy, thus a'i wins against a\. When n = n', the strategies, arranged in ascending order of 
strength are ai, a^, a2, a' 2 , . . . a n , a' n . When instead n / n', then it must be the case that the same 
player has weakest and strongest strategy; the strategies, arranged in ascending order of strength, 
are a\, a'i, . . . , a n , a n , o n +i in this case. 

Suppose that in some Nash equilibrium M the row player does not use strategy aj for some 
j > 1 (that is, the player plays aj with probability 0). Then the column player does not use strategy 
a'j (which is the cheapest one that wins against aj) since a'- would now be dominated by a'-_ l . For 
a similar reason, the row player will not use Oj+i, the cheapest strategy that wins against a'j, so the 
column player will not use a'j +1 , and so on. This shows that (in a Nash equilibrium) the strategies 
in either player's support must be either a prefix of the sequence of his strategies or a prefix of all 
his strategies but the weakest, with strategies arranged in ascending order of strength. 

We can now try to solve for all such supports, since there are polynomial-many of them. Recall 
that a 2-player game can be solved efficiently in polynomial time if we are told the support of a 
solution, since it reduces to a linear program (see, for example, page 31 in pD]). □ 

The main property used to show Theorem [3] above (i.e., if in any solution a player does not 
play a certain strategy s then the other one does not play the strategy that "just beats" s) breaks 
down when ties are allowed. Simply consider two consecutive strategies of player 1, aj and aj +1 , 
and of player 2, aj and a| +1 such that scores of aj and a? (resp. aj +1 and a? +1 ) are the same. In 
this case, we can no longer conclude that in any solution if player 1 does not play aj then player 
2 does not play a| +1 as such a strategy is used not just to beat aj but also to share with aj +1 . 
This shows that for games where ties may occur, we have to use different approaches to obtain 
polynomial-time algorithms for them. 

3.3 Exact algorithm for linear-prize ranking games 

Consider a (i-player n-strategies-per-player ranking game Q with competitiveness-based strategies 
without ties in which the prize for ranking /c-th is a linear function a — kb, for some values a and 
b. We call Q a linear-prize ranking game. We claim that we can represent Q as a poly matrix 
game [15]. A poly matrix game can be represented as a graph: players are the verticals and a 
player's payoff depends on the actions of his neighbors. The edges are 2-player zero-sum games. 
Once all players have chosen a strategy, the payoff of each player is the sum of the payoffs of the 
games played with his neighbors. Dashikis and Papadimitriou [15] give an algorithm which solves 
polymatrix games in polynomial time. 

We can express Q as a polymatrix game as follows. We define a complete (d + l)-vertex graph 
where the additional vertex encodes an external player N, which we call "nature". For players i 
and i' in Q, edge (i,i') is an n x n 2-player constant-sum game. (This is a zero-sum game, shifted 
by a constant — this shifting can be accommodated in polymatrix games.) The game on edge (i, i') 
"punishes" the lowest-ranked player amongst i and i! . In particular, in the matrix of this game, 
the entry will have payoff for player i and —b for player i' if and only if s l j > s*,. Edge 

(i, N) is an n x 1 2-player zero-sum game in which nature "gains" what player i is paying in effort 
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minus a — b. In particular, each entry j will have payoff a — b — c^ for player i and payoff cj — a + b 
for nature. Note that once all players of Q have chosen strategies, the payoff to the player who is 
ranked /c-th is a — kb minus his cost. He loses b to each of the k — 1 players who beats him and he 
gains a — b and loses his cost of effort, to nature. Thus, using the algorithm in [15], we obtain the 
following theorem. 

Theorem 4 There is a polynomial-time algorithm that computes a Nash equilibrium for linear- 
prize ranking games. 

3.4 Games where ties are possible 

We consider a more general situation in which two or players may have pure strategies having the 
same score, and if they play those strategies, any prizes are shared. We begin by showing that 
we can study without loss of generality Nash equilibria of competitiveness-based ranking games in 
which scores are symmetric. That is, players have a shared set of n pure strategies, a\, . . . ,a n . As 
above, player i has player-dependent costs c\ < ... < c l n , but the scores are player-independent, 
and we denote them s\ < . . . < s n . 

3.4.1 Reduction to score-symmetric games 

The reduction preserves Nash equilibria of the original game (and thus is a Nash homomorphism) 
and is presented for the case of 2-player games. The generalisation to the d-player case is straight- 
forward, although the number of strategies per player would increase by a factor of d. 

Suppose that player 1 has action aj and player 2 has no action with score equal to sj. We give 
player 2 a weakly dominated strategy with score sj — if a\ is the weakest strategy of player 2 that 
has higher score than aj, give player 2 an additional strategy with cost c\ and score sj. If player 
2 does not have a stronger strategy than aj, give player 2 an additional strategy with cost 1 and 
score sj. 

We can assume that each player has n strategies a\, . . . , a n with scores si, . . . , s n and costs 
c\, . . . , c„ for player 1 and cf , . . . , c 2 n for player 2. Suppose we solve this game, and now we have 
to recover a solution to the original game before the weakly dominated strategies were added. 
To do this, each player just has to replace their usage of any weakly dominated strategy by the 
corresponding weakly dominating one. Let a,- be a weakly dominated action of player 1; a& denotes 
the corresponding weakly dominating action, k > j and then Sk > Sj. The probability of player 1 
playing aj at equilibrium is then added to his probability of playing . This raises the question of 
whether player 2 may be given an incentive to deviate as a consequence. Such an incentive can only 
concern actions with scores within the interval / := [sj, s^] (for actions with score outside /, player 
2 has the same payoff when player 1 plays either Oj or a& since in these cases the ranking of the 
players do not change). First, note that if in the Nash equilibrium player 2 plays actions with scores 
in / with positive probability then player 1 plays aj with probability 0. This is because player 1 
strictly prefers a^ to aj when player 2 plays actions with score in / and is indifferent between them 
for the remaining actions. Therefore, we do not need to redistribute probability mass in this case. 
On the other hand, whenever in the Nash equilibrium the actions having score in / are played by 
player 2 with zero probability then the change of the probability distribution of player 1 has no 
effect on player 2; as observed above, for each action with score outside /, player 2 has the same 
payoff when player 1 plays either aj or a^ and then he has no incentive to deviate. 
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3.4.2 Score-symmetric games and pure equilibria 

Unlike games without ties, for which 2-player 2- action games might not possess pure equilibria (see 
Example [1]), score-symmetric games in which players have only 2 strategies do have pure Nash 
equilibria (for any number of players and any number of prizes). 

Theorem 5 2-action competitiveness-based score-symmetric ranking games do have pure Nash 
equilibria (any number of players; any action costs for individual players). Furthermore, a pure 
Nash equilibrium can be found in polynomial time. 

Proof. We have 2 pure strategies a\ and 02, where a\ is less competitive; thus for each player i we 
have c\ < c\. Recall that by preprocessing we may assume that c\ = 0, so that c\ is non-negative, 
for all i. We show how to identify a pure Nash equilibrium that consists of a (potentially empty) 
set of players playing 02, all of whom have a cost for playing a<i lower than the ones playing a\. 

We may assume that the players are indexed in non-decreasing order of their cost of playing 
d2, so that for 1 < i < n we have c\ < ($2 . Now, let Zi denote the pure profile in which the 
first % players play 02 and the remaining d — i players play a\. We claim that if i has an incentive 
to deviate from then no player i' has an incentive to deviate from Zi, with i' < i. Indeed, 
if i has an incentive to deviate from Zi~x, then Ui ^_' i ^ d , his utility in Zi—x, is strictly less than 
m+.„+Hi _ ^ utility in Zi. Since, by definition, c 2 < c 2 , for i' < i, the previous inequality 
implies that player i' is better off by sticking to 02 in Z{. Therefore, starting from the profile Zq 
in which all players play ax, we initially check whether player 1 has an incentive to deviate. If not, 
the profile is a pure Nash equilibrium, otherwise we let him deviate and we have the profile Z\ . We 
can reiterate this process for each player % in this ordering until we reach the break-even point at 
which the share of the prize obtained from playing 02 goes down below the cost of the next player 
in line. This shows the existence of the claimed pure Nash Equilibrium, and also constitutes an 
efficient algorithm for finding it. □ 

A related result is known for symmetric games in which players have only 2 strategies: These 
games always have a pure Nash equilibrium [TO] . The above theorem concerns games that are 
anonymous but not symmetric (as costs are player-specific). However, let us notice that the argu- 
ments used in [1] to prove the result about symmetric games appear to be similar to ours. It is 
easy to see that these arguments fail when two players have three strategies available, as shown by 
the next example. 

Example 4 We have 2 players and 3 actions, namely ax, 02 and 03 ordered increasingly by score, 
i.e., si < S2 < S3. The prizes are u\ = 1 and ui = 0, U3 = 0. Costs are c\ = for i = 1,2, 
c\ = I , c\ = I and c\ = \,c\ = | . Thus, we have payoff matrix 



It is easily checked that this game has no pure Nash equilibrium. The unique Nash equilibrium of 
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3.4.3 PTAS for many players who share a fixed set of strategies 

Consider a score-symmetric ci-player game, each of whom have actions ai,...,a n with scores 
s\, . . . ,s n . In this section we view the number of actions n as a constant, and we are interested 
in algorithms whose runtime has polynomial dependence on d, the number of players. A score- 
symmetric game is a special case of an anonymous game, so it is possible to directly apply a result 
of Daskalakis and Papadimitriou [H] to show that it has a PTAS. Here, we give a conceptually 
simpler PTAS. 

In the PTAS, we first round the cost vectors of the players. This does not introduce much error, 
but it ensures that there are only a constant number of different cost vectors. We refer to the cost 
vector of a player as its "type" . Now the point is that players of the same type are equivalent in 
the following sense — once we know how many players of each type adopt each (mixed) strategy, 
we can examine the resulting strategy profile to check whether it is an e-Nash equilibrium. It is 
only important how many players of each type adopt a particular strategy — it is not important 
which players they are. Thus, brute-force search is quite efficient. Technically this algorithm is 
oblivious in the sense of Daskalakis and Papadimitriou [16], in that it constructs a polynomial-sized 
set of mixed-strategy profiles in such a way that at least one of them should be an approximate 
equilibrium, and checks each of them. 

By a k- composition of a positive integer N, we mean a solution to N± + . . . + Ni~ = N in which 
N\, . . . ,Nk are non-negative integers. There are less than N k such solutions. 



Algorithm 1: PTAS for score-symmetric d-player games having a constant number, n, of 
pure strategies and a given accuracy parameter e > 

1 Let 5 = e/n and £ = [l/e~|. 

2 For each player, round each cost c*- down to the nearest non- negative integer multiple of e. 
Two players have the same type if they have the same set of rounded costs. Let T be the set 
of types; note that |T| < {t + 1)". 

3 Let S be the set of n-dimensional probability vectors {(x±, . . . , x n )} in which each Xj is a 
non-negative integer multiple of 5. (Mixed strategies for individual players will be sought 
from elements of S.) Let s = \S\, and note that s < (ni + l) n . 

4 Perform a brute-force search as follows. For every type t € T, consider every s-composition 
dt = dt,i + • • • + d ttS of the a\ players of type t. Consider the strategy profile in which, for all 
t £ T and all j £ {1, . . . , s}, dtj players of type i play the j-th strategy in S. Check whether 
this strategy profile is an e-Nash equilibrium. 

5 Return an e-Nash equilibrium if one is found. 



Theorem 6 For any constant e > and any constant n, Algorithm[l\ returns a 2e-Nash equilibrium 
in time polynomial in d. 

Proof. It is convenient to assume in the proof that e (and hence 5) is the inverse of an integer. 
This can be assumed by rounding e down to the nearest such fraction e', obtaining an e'-Nash 
equilibrium, which is consequently an e-Nash equilibrium. 

Consider first the case in which e is the inverse of an integer. Since costs lie in the range 
[0, 1] we can see that after Step [2] is executed there are less than (£ + l) n distinct player types. 
We will find an e-Nash equilibrium of the rounded game, which is a 2e-Nash equilibrium of the 
original game since each cost is only changed by at most an additive e. Note that |T| = 0(1) as a 
function of d, the number of players. Also, the number s of strategies in S constructed in Step [3] 
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is 0(1) as a function of d. Since the number of s-compositions of a\ (StepEJ) is 0((dt) s ), the total 
number of mixed-strategy profiles considered in Step H] is at most 0(Y\ te j-(dt) s ) = 0(d s ^) which 
is polynomial in d. 

We now show that the algorithm always finds an e-Nash equilibrium in the last step. First, 
consider the game with rounded cost vectors, as constructed in Step [21 and let N be a Nash 
equilibrium of this game. Consider a ^-rounding (as from Observation [T]) of each probability vector 
of j\f and notice that such a probability vector is checked by the algorithm (or an equivalent one 
is, in which the identities of players of the same type are swapped). We next show that such a 
probability vector is an e-Nash equilibrium. A player can be playing an action with a probability 
that differs by at most S from the probability he should have used for his best response. Thus for 
each of his actions, he can lose at most 5 times the payoff he gets for that action. Since payoffs are 
upper-bounded by 1, the maximum regret is at most n5 = e. The proof concludes by noting that 
the cost rounding of Step [2] implies an extra additive error of at most e. □ 

3.4.4 FPTAS for constant number of players, many strategies 

Let Q be a score-symmetric game with d players having access to n (shared) pure strategies. In 
this section we view the number of players d as a constant, and the number of pure strategies n is 
the parameter that governs the size of a game. Let {1, . . . , d} denote the players. We have d prizes 
of values u\ > U2 > • • • > itrf, where U\ = 1 and Ud = 0. 

In this case, the expected payoff of player i from playing aj is given by the expected prize he 
gets minus his cost c* of dj. To define the expected prize that the player gets by playing a.j we 
need some notation. Let B = {—1,0, 1} and v = (vi,V2, ■ ■ ■ , Urf) € B d be a vector that is defined 
with reference to some aj as follows. Vk gives information about the pure strategy played by player 
k G {1, . . . , d}, where Vk = means that player k plays a,-, Vk = —1 means that player k plays one 
of the actions that are less competitive than aj (that is, one of a±, . . . , a,j-i), and finally, = 1 
means that k plays one of the more competitive actions than aj (one of Oj+i, . . . , a n ). Given vector 
v G B d we let v(l) be the number of l's in v, i.e., v(l) = \{vi : Vi = 1}|; similarly, v(0) (v(— 1), 
respectively) denotes the number of O's (—l's, respectively) in v; thus, v{— 1) + v(0) + v(l) = d. 

Now, let x l j be the probability that player i\ plays aj, and let i2,...,id be the remaining 
players. Observe that, if player i\ plays purely aj, i.e., v\ = 0, then prizes u\,U2, ■ ■ ■ ,u v ru will be 
given to v(l) players playing more competitive actions than aj, and prizes u q+ \,u q+ 2, ■ ■ ■ ,Ud are 
reserved for the v(— 1) players playing less competitive actions than aj, where q = d — v(—l). Thus, 
the total value of prizes to be shared among the v(0) players playing aj (including player ii) is 

U v (l) + 1 + Uv(l)+2 H \~ u q- 

We also denote by 1) = YjgZi x \ andp* (l) = Y17=j+l x e> * ne probabilities of player i playing 
actions that are less (respectively, more) competitive than aj; furthermore, let p l j(0) = x % y To cover 
degenerate cases we also assume that p\(— 1) = 0, i.e., the probability of i playing an action less 
competitive than a\ (the weakest action) is zero; similarly, p^(l) = 0. Then the expected payoff of 
player i\ for playing aj, denoted as 7T* 1 , is given by 

v=(v 1 =0,v 2 ,...,v d )£{0}xB d - 1 \k=2 / 

To compute a Nash equilibrium, we need to find real values x\, . . . , x % n , fori € {1,2,... ,d}, that 
satisfy 

4>0Vi,j; ]>>; = 1 i€{l,2,...,d} (4) 

j 
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saying that for i € {1,2, ... , d}, the values {xfij are a probability distribution; for i G {1, 2, . . . , d} 
and j > 1 the following should also hold: 

ttJ > maxfc=i,...j_i{7r|} ==> x\ = ...= x)_ x = 0, 
?rj < max fe= i v ..j_i{7r|} ==> xj = 0. 

Lemma 1 XTie values x* satisfy ^ and f^j if arad only if i/iey are a iVas/i equilibrium. 

Proof. The sets {x*}j, for i € {1,2, . . . , d}, are constrained by ([4]) to be probability distributions. 

We claim that expressions ([5]) are equivalent to the definition of Nash equilibrium constraints ([1]) . 
If a,j give player i a higher payoff that all previous (weaker) strategies then none of those may be 
in player i's support. Similarly, when a,j gives a lower payoff than a weaker strategy, aj will not 
be in the support. Note that if a,j gives a higher payoff than the weaker actions but a lower payoff 
than a stronger strategy dji then the probability of playing aj will be set to when tt 1 -, is compared 
with maxjt = x i ... ) , i ... ):7 '_i{7r^}. □ 

Consequently we have reduced the problem to satisfying the constraints (j4]) and ([5]). We now 
define variables in addition to x's and 7r's with the aim of expressing @ and ([5]) in terms of a 
constant number of "local" variables. This is needed to define our FPTAS. Let <r*- be the partial 
sum Yle=i x \- We can now express (jH) as follows: 



a\ = x \ < a) < 1 a)_ x + x) = a) 
4 = 1 < x) < 1, 



(6) 



and observing pU—1) = a^x, Pj-(0) = x l - and Pj(l) = I — (J 1 -, we can now express it* 1 in ([3]) only in 
terms of variables cr^ 2 _i, ■ ■ ■ , a ) d -n X J ■> ■ ■ ■ > x j- d i ^j 2 ' ' ' ' ' ^ as 



»i = ii, ^(i )+ i + ■ ■ ■ + n g fa i2 ^ . i i i 

ve{o}xB d - 1 v ; 



where function ^ r j 1 (-) is the product rifc=2^j fe (' y fc) written in terms of these variables. Observe that 
if j = 1, then <t* 2 _ 1 = • • • = <r* d _ 1 = 0, and if j = n, then a l ? = ■ ■ ■ = <7* d = 1; thus, in these cases 
function ^ r *- 1 (-) does not depend on these respective variables. 

Additionally, let ofy be the maximum expected payoff player i can get by playing one of a\, . . . , ctj, 
i.e., a l j = maxfc = i j ... j j{7r[.}. We can now define 



and express ([5]) as follows: 



: ir\ a l j = max{a 1 j_ 1 ,TTj} (8) 

tt) < a} , > = 0. ' 



Observation 4 XTie values and ttj satisfy (SS^MW V an d on ^V the values x*- are a 

Nash equilibrium. 

Now consider the sequence 

S = (it j , . . . , TTj , Xj , . . . , Xj , Oij , . . . , Oij , (Jj , . . . , (Jj )j = l,. ,.,n- 
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Constraints in ([6]) involve 3 variables that are at distance at most Ad + 1 in S (namely, for j > 1 , 
is followed by 4d elements of S -including Xj- and then by cr*). Constraints ([7]) on the other 
hand involve variables that are at distance at most 5d in S. It is easy to check that the same 
happens also for the other constraints and conclude then that the following holds. 

Observation 5 For any j = l,...,n, for each constraint in |j|), |?p, |2jj and (0|) there are 5d 
consecutive elements of S that contains the quantities involved in the constraint. 

The algorithm. For e > according to (J2J) we relax the constraints of (|9j) as follows: 

1 >a r 1+e l- 1 ^ 0, do) 

n j < ~ e ^ x 3 = °- 

Let Si be the sequence of 5d consecutive elements of S that begins at the i-th element of S. 
Let Si be the set of expressions in ([6]), (J7|), j8|) and (fTUj) that relate elements of Si with each other; 
by Observation [5] the union of the sets Ei is all constraints © , (J7|) , dH|) and (fT0|) . The algorithm 
(Algorithm [2]) works its way through the sequence S left-to-right, and for each Si identifies a subset 
of ([0, representing possible values of those quantities that form part of an approximate Nash 
equilibrium. We call this subset Di. Then it sweeps through the sequence right-to-left identifying 
allowable values for previous elements. The parameter e controls quality of approximation. 

Algorithm 2: FPTAS for score-symmetric (i-player games (where d is constant) having a 
variable number, n, of pure strategies and accuracy parameter e > 

1 For each player, round each cost A down to the nearest non-negative integer multiple of 5, 
where 5 = 

2 For 1 < % < 4dn — 5d + 1, let Di be the set of all 5d-dimensional vectors that are calculated 
as follows: 

a. Generate all non- negative integer multiples of 5 for the x's in Si. 

b. For each collection of such values for the x's use the constraints from ([6]) in £ j to 
calculate the corresponding values of the cr's in Si. 

c. For each collection of values of x's and cr's use the constraints from (|7|) in Ei to 
calculate values of the 7r's in Si. 

d. Calculate the a's values by applying the constraints from (JSj) in Ei. 

e. Among all computed vectors computed above, keep in Di only those that fulfill the 
constraints from ()10p in Ei. 

3 For i > 1 (in ascending order) keep in Di only vectors s for which there is at least one vector 
s' in such that the first 5d — 1 entries of s are the same as the last 5d — 1 entries of s' . 

4 Let S4dn-5d+i be a point in D^dn-sd+i- For 1 < i < 4dn — 5d + 1 (in descending order) let 
be a point in Di chosen so that its last 5d — 1 coordinates are the first 5d — 1 coordinates of 

Sj+l- 

5 Let s be the vector of length Adn such that Sj is the i-th sequence of 5d consecutive 
coordinates of s. Set x*- to the entry of s that corresponds to the position of x % - in S. 



Theorem 7 There is a FPTAS for computing e-Nash equilibria of competitiveness-based ranking 
games with a constant number of players. 

When the number of players is a constant, we may assume that the games is score-symmetric 
by applying the reduction of Section 13.4.11 which is then achieved at the price of a constant-factor 
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increase in the number n of strategies per player. We show that Algorithm [2] is indeed a FPTAS 
for this class of games. The proofs assume that e is the inverse of an integer. (Similarly to above, 
if this is not the case we simply run the algorithm with an e' < e which is inverse of an integer.) 
Theorem [7] will follow from the following two propositions and the subsequent observation about 
the runtime. 

Proposition 1 (Approximation guarantee) If Algorithm^ finds a vectors in Step\£^ then the 
values x l j correspond to a (n + 2)e-Nash equilibrium. 

Proof. The entries of s (including the subset corresponding to Xj) satisfy ©, (|7|), (J8|) and (fT0j) , 
where (jlOp simply rewrites the definition of e-Nash equilibrium ([2]) thus implying that we are losing 
an additive e. Another additive loss smaller than e is due to the cost rounding. Furthermore, we 
are restricting to probability distributions whose values are non-negative integer multiples of 5. 
Thus, a player may be forced to play a strategy with a probability that differs by at most 5 from 
the probability of his best response. This may impose an additional additive error of n5 < ne in 
the worst case (this is because we have n actions and on each of them the best response is at most 
5 different, while the payoffs are upper bounded by 1). □ 

Proposition 2 (Correctness) Algorithm^ always finds a vectors in Step\^ 

Proof. Consider a Nash equilibrium W and the associated vector s. Take a Grounding x (5 = 
e/(4d 2 ■ 3 d )) for each probability vector x in s. Use (0) to define the corresponding rounded vector 
a. Given x and <r, use ([7]) and ([8]) to define values of tt and a, respectively; the newly-obtained 
vector of rounded values is denoted as s. Observe that s is considered by the algorithm. It thus 
suffices to show that such a sequence satisfies all the constraints that the algorithm imposes on the 
output. 

By construction, ©, (J7]) and (|8|) are fulfilled. For constraint ([TO]) we show that fr*- > fi^j + e 
crj-_i = 0. (Very similar arguments can be used to show the other condition of (IIOD .) 

We will show first that | j/J ~ Vj\ < e /2 for y € {it, a}. We will only give the details for y = tt 
as the argument is similar in the other case. Let us first focus on the product Y\k=2Pj k ( v k) m © 
in which the factors crV^, 1 — a 1 -' and x 1 -* are involved. Now, the i's are defined as <5-roundings 
of the corresponding cc's (according to Observation [1]) . The quantity <t*- is the sum of the first j 
entries of x l . Thus, Observation Q] allows us to deduce that the rounded values GjLn 1 — crV° and 
x l j k are less than 5 away from ffjLi, 1 — 0j- fc and x* fe in absolute value, respectively. In other words, 

\p] k ( v k) -Pj k (vk)\ < S for any i k and j. 
Then we have 

d d d d 

X\p) k {v k ) -\\p) k {y k ) < Y[(pj(v k ) + 6) - Hpf(v k ). 

Jc — 2 /c — 2 k — 2 k — 2 

An upper bound for the right-hand side is obtained by setting p l j° (v k ) = 1, resulting in an upper 
bound of (1 + (5) rf_1 — 1. 5 was chosen sufficiently small to ensure that this is at most e/(4 • S^" 1 ). 
Now observe that the component M "( 1 )+ 1+M ^w+ 2+ +Up m Q h as value at most one and there are at 

most 3 d ~ l terms in the summation of ([3]), which implies that the difference between the summations 
for TTj 1 and 7T* 1 is strictly smaller than e/4. We have \-k 1 ^ — 7T* 1 1 < e/2. 

Thus we have shown that — yl\ < e/2 for y € {vr,a} and so —e/2 < y % - — y l - < e/2 for 
y £ {7r,a}. Therefore ff*- > + e implies that 7r!- > o;!_i and as s is a Nash equilibrium, 
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by Observation U] and Q, we have <7j_i = 0. But then by the way we define cr's we have that 
SJ_! = 0. □ 

Runtime. Since the values of cr's, 7r's and a's are calculated applying ([6]), ((7|) and © respectively 
for given x's, the sets -Dj are of size 0((l/5) d ) = 0((l/e) d ), so for constant d the runtime of the 
algorithm is indeed polynomial in n and 1/e, as required for a FPTAS. 

Remarks. Algorithm [2] is somewhat similar to the algorithm of [24j for solving tree-structured 
graphical games. They give a similar forward-and-backward dynamic programming approach to 
solving these games; their algorithm takes exponential time for exact equilibria |17| but a similar 
quantisation of real- valued payoffs leads to a FPTAS. 

4 Conclusions and further work 

Our FPTAS can be used to compute exact equilibria in certain cases. When a game with constantly- 
many players has payoffs that are multiples of some e > then we can compute exact Nash equilibria 
in time polynomial in the size of the input and 1/e by simply using the FPTAS. This observation 
raises the open problem of determining whether there is a polynomial-time algorithm for solving 
2-player (competitiveness-based ranking) games in general when ties are possible and the prize is 
shared in the event of a tie. 

Several other concrete open problems have been raised by the current results, for example, fully 
quantifying the complexity of computing Nash equilibria for competitiveness-based ranking games. 
Also, in situations where multiple equilibria may exist, we would like to know whether a specific 
equilibrium is selected by some natural decentralized dynamic process. 

Acknowledgements. We thank David Gill, Milan Vojnovik and Yoram Bachrach for pointers to 
related work. 
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